Class constrained ROVER based speech enhancement

نویسندگان

  • Amit Das
  • John H. L. Hansen
چکیده

A phoneme class based speech enhancement algorithm is proposed that is derived from the family of constrained iterative enhancement schemes. The algorithm is a Rover based solution that overcomes three limitations of the iterative scheme. It removes the dependency of the terminating iteration, employs direct phoneme class constraints, and achieves suppression of audible noise. In the Rover scheme, the degraded utterance is partitioned into segments based on class, and class specific constraints are applied on each segment using a hard decision method. To alleviate the effect of hard decision errors, a GMM based maximum likelihood (ML) soft decision method is also introduced. Performance evaluation is done using Itakura-Saito, segSNR, and PESQ metrics for four noise types at two SNRs. It is shown that the proposed algorithm outperforms other baseline algorithms like Auto-LSP and log-MMSE for all noise types and levels and achieves a greater degree of consistency in improving quality for most phoneme classes.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Novel Frequency Domain Linearly Constrained Minimum Variance Filter for Speech Enhancement

A reliable speech enhancement method is important for speech applications as a pre-processing step to improve their overall performance. In this paper, we propose a novel frequency domain method for single channel speech enhancement. Conventional frequency domain methods usually neglect the correlation between neighboring time-frequency components of the signals. In the proposed method, we take...

متن کامل

Decision directed constrained iterative speech enhancement

Earlier studies have shown that degradation due to environmental background noise is non-uniform across various phoneme classes of speech. In this study, we present an improved formulation of single channel constrained iterative speech enhancement (AutoLSP) that follows a rover based paradigm. The new approach overcomes some of the drawbacks observed earlier in the baseline AutoLSP system. Firs...

متن کامل

Markov Model Based Phoneme Class Partitioning for ImprovedConstrained Iterative Speech

Research has shown that degrading acoustic background noise innuences speech quality across phoneme classes in a non-uniform manner. This results in variable quality performance of many speech enhancement algorithms in noisy environments. A phoneme classiication procedure is proposed which directs single-channel constrained speech enhancement. The procedure performs broad phoneme class partitio...

متن کامل

ROVER Enhancement with Automatic Error Detection

In this paper, an approach is presented to improve the existing performance of the Recognizer Output Voting Error Reduction (ROVER) procedure used for speech decoders’ combination in automatic speech transcription. A contextual analysis is injected within the ROVER process to detect and eliminate erroneous words. This filtering is carried out through the combination of automatic error detection...

متن کامل

Low-dimensional recurrent neural network-based Kalman filter for speech enhancement

This paper proposes a new recurrent neural network-based Kalman filter for speech enhancement, based on a noise-constrained least squares estimate. The parameters of speech signal modeled as autoregressive process are first estimated by using the proposed recurrent neural network and the speech signal is then recovered from Kalman filtering. The proposed recurrent neural network is globally asy...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007